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LEUKAEMIA INHIBITORY FAOTQT^ 

This invention, relates to Leukaemia Inhibitory 
Factor, to ^ its production and uses. 

Leukaemia Inhibitory Factor (LIF) is a secreted , 
soliible polypeptide regulatory factor which has the 
5 characteristic of specifically suppressing the 

differentiation, and maintaining pluripotentiality, of 
murine embryo-derived pluripotential embryonic stem cells. 

Experiments using ES cell lines established 
directly from the inner cell mass of the early mouse embryo 
10 (Evans and Kaufman, 1981; Martin, 1981), suggest that LIF 
is a normal embryonic regulatory factor. In the presence 
of LIF, also known as DIA (dif f erentiation-.inhibiting 
activity) , ES cells can be cultured and maintained in vitro 
without loss pluripotentiality. Even after many 
15 generations in vitro , these cells can be reintroduced into 
the mouse blastyocyst where they may contribute 
differentiated progeny to all tissues including the germ 
line (Bradley et ai 1984). 

EP-A-285 448 discloses the complete amino acid 
20 sequence of a form of murine LIF and also discloses a 

sequence, incomplete at the N-terminus, of the human form 
of this factor. The murine LIF disclosed by EP-A-285 448 
has the N-terminal sequence NH2-met-lys-val-leu-ala-ala-gly 
(KVL-LIF) . 

25 Stahl et al (J.Biol Chem. , 265(15), 8833 (May 1990)) 
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disclose the genomic sequence of human LIF. 

We have now surprisingly foimd that certain cell 
lines which exhibit DIA-LIF activity do so via a novel form 
of LIF which has the N-terminal sequence NH2-inet-arg-cys- 
5 arg (RCR-LIF) • This form of LIF occurs as a result of a 
splicing event which joins a new 5'exon to the second and 
third exons of the previously described form of iJF.- This 
alternative splicing thus replaces the first seven araino 
acid residues of KVL-LIF with the first four amino acids of 

10 RCR-LIF. The result of this alteration in the amino 

terminus of the LIF protein results in the incorporation of 
RCR-LIF into the extracellular matrix (ECM) . RCR-LIF is 
therefore a localised, as opposed to diffusible, 
determinant of differentiation, 

15 As a result of our findings, we have identified two 

proteins, laminin-)32 and e-cadherin, which form part of the 
normal ECM which have, at or near their N-termini, the 
sequence: 

-arg-J-arg- 

20 where J- is glycine in laminin-j92 and cysteine in e- 

cadherin. However, since these proteins share many other 
sequence similarities, it was not previously apparent 
whether any part of these proteins - and if so, which - was 
responsible for their presence in the ECM. 

25 We have found that the localization of RCR-LIF can 

be demonstrated by the inability of this form of LIF, when 
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associated with an ECM, to diffuse through a microporous 
membrane.' As demonstrated in the following examples, a 
cell line^ responsive to the action of LIF, such as a 
pluripotent cell line (for example' the ES cell line 
5 mentioned above) , will only respond to an ECM containing 
this form of LIF when in direct contact with the ECM. The 
presence of a microporous membrane between an ECM 
containing RCR-LIF and LIF-responsive cells will block the 
action of RCR-LIF on the cells, but not the action of the 

10 diffusable KVL-LIF, which is not retained by the ECM. The 
RCR-LIF thus remains associated with the ECM. In the 
absence of a physical barrier such as a microporous 
membrane, both forms of LIF act upon LIF-responsive cells. 

RCR-LIF is thus distinguished from KVL-LIF by the 

15 following criteria: 

1. LIF activity is physically confined* to the 
extracellular matrix (ECM) and is thus only acts upon cells 
which are in direct physical proximity to ECM containing 
RCR-LIF. 

20 - 2. ECM associated RCR-LIF source has superior 

stability to KVL-LIF. 

3. RCR-LIF activity is confined to cells which 

are capable of physically associating with the RCR-LIF 

associated ECM preparation and therefore its range of 
25 biological activities in vivo and in vitro can be targeted 

more sp cifically than KVL-LIF. 
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4, RCR-LIF can be presented in a more 
concentrated form (as it is not freely diffusible) with 
improved biological efficacy compared to KVL-LIF derived 
from non-ECM associated sources. 
5 In addition, RCR-LIF as defined by these parameters 

has the following properties: 

1. Enhances the survival of sympathetic and CNS 
derived neurons in vivo and in vitro > 

2. Regulates the expression of neurotransmitters, 
10 for example, choline acetyltransf erase and acetylcholine 

esterase, in neuronal preparations. 

3. Induces the proliferation of cells of the 
osteoblast lineage in vivo and in vitro . 

4. Induces activation of osteclastic cells in 
15 osteogenic tissues, 

5. Supresses the expression of lipoprotein lipase 
in cells of the adipocytic phenotype in vivo and in vitro , 

6. Induces the expression of acute phase response 
proteins in hepatocytes or hepatocyte derived cell types. 

20 -7. Enhances the viability and multiplication of 

keratinocytes in vitro, 

8. Maintains the proliferation of haemopoetic 

cells. 

9. Induces the differentiation of leukaemic 

25 cells. 

10. Enhances the viability and multiplication of 
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germ cells. 

The novel N-terminal sequence of the LIF of the 

present invention pVovides a signal sequence for directing 

and anchoring molecules such as proteins or peptides to a 

5 mammalian ECM. Therefore, the tetrapeptide sequence NH2- 

met-arg-cys-arg may be attached by chemical or recombinant 

, 

means to proteins or peptides to cause the molecules to 

become attached to or associated with the ECM. C-terminal 

derivatives of this tetrapeptide sequence corresponding to 

10 further LIF sequence, eg. NH2-met-arg-cys-arg-ile-val, may 
also be used. It is to be understood that changes in the 
tetrapeptide sequence NH2-Met-Arg-Cys-Arg which do not 
substantially alter the ability of this sequence to direct 
association with the ECM are within the scope of the 

15 invention. 

The LIF of the present invention can be used for 
the propagation of embryonic stem (ES) cells and for the 
maintenance of ES cell pluripotentiality (as defined by the 
ability to form functional gametes in chimeras) . In 

2 0 addition, LIF of the invention and N-terminal fragments 
thereof, and polyclonal or monoclonal antibodies or 
fragments thereof against LIF and its N-terminal fragments 
have potential utility for the following applications: 

1. Induction and suppression of haemopoetic cell 
25 differentiation in vivo and in vitro. 

2. Induction of bone and tooth deposition and 
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resorption in vivo. 

* 

3. Regulation of acute phase response protein 
gene expression in liver cells. 

4. Regulation of cachexia. 

5 5. Regulation of neuron viability and 

differentiation. 

6. Acceleration/Inhibition of wound healing. 

7. Induction and inhibition of epithelial cell 
proliferation and differentiation. 

10 The present invention thus provides a novel LIF 

which is capable of associating with a mammalian 
extracellular matrix. In one aspect of the invention, this 
novel LIF is characterised by an N-terminal sequence NH2- 
met-arg-cys-arg . 

15 The invention also provides novel forms of LIF 

containing sequences which are substantially identical to 
the sequence met-arg-cys-arg which are capable of causing 
LIF to be associated with the ECM. In particular, 
homologues of this LIF sequence in which 2 of the first 5 

20 amino acid residues from the N-terminus are substituted 
are within the scope of the invention. 

A further embodiment of the invention provides a 
DNA coding for any of these novel forms of LIF, N-terminal 
signal sequences, or recombinant proteins or peptides 

25 described above. The DNA may be used to produce a labelled 
probe by conventional means using radioactive or non- 
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radiaoactive labels, or the DNA may be cloned into a 
vector. 

A further embodiment of the invention provides 
vectors for the replication and expression of the said DNA. 
5 The vectors may be, for example, plasmid, virus or phage 
' vectors provided with an origin of replication, optionally 
a promoter for the expression of the said DNA zmd 
optionally a regulator of the promoter. The vector may 
contain one or more selectable marker genes, for example an 

10 ampicillin resistance gene in the case of a bacterial 
plasmid or a neomycin resistance gene for a mammalian 
vector. The vector may be used in vitro , for example for 
the production of RNA corresponci ing to the DNA, or used to 
transfect or transform a host cell. 

15 A further embodiment of the invention provides host 

cells transformed or transfected with the vectors for the 
replication and expression of the said LIF DNA. The cells 
will be chosen to be compatible with the vector and may for 
example be bacterial, yeast or mammalian. 

20 The invention also provides N-terminally directed 

monoclonal or" polyclonal antibodies, or fragments thereof, 
to the novel proteins or peptides, or N-terminal fragments 
thereof, of the invention. 

The invention also provides a process for the 

25 production of monoclonal or polyclonal antibodies to the 
novel proteins or peptides of the invention. Monoclonal 
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antibodies may be prepared by conventional hybridoma 
technology using the noyel proteins or peptides, or 
fragments thereof, as an immunogen* Polyclonal antibodies 
may also be prepared by conventional means which comprise 
5 innoculating a host animal, for example a rat or a rabbit, 
with a protein or peptide of the invention and recovering 
immune serxim. 

The present invention also provides pharmaceutical 
compositions containing the LIF proteins or peptides of the 

10 invention, or fragments thereof, and antibodies or 

fragments thereof to the proteins, peptides and fragments 
thereof for the treatment and/or regulation of conditions, 
including proliferative diseases, associated with abnormal 
(eg at an unusually high or low level) and/or aberrant (eg 

15 due to a mutation in the gene sequence) expression of LIF 
in a mammal, including man, or for the treatment of 
conditions or diseases of a mammal, including man which 
benefit from an alteration in naturally occuring levels of 
LIF, 

20 The invention also provides the above protein, 

peptides and antibodies and fragments thereof, and DNA 
. coding for the said proteins and peptides (or fragments 
thereof) for the therapy or diagnosis of conditions, 
including proliferative diseases, associated with abnormal 

25 and/or aberrant expression of LIF in a mammal, including 
man. 
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In addition, the invention provides peptides of the 
sequence NH2-inet-arg-cys-arg, C-tenninal derivatives 
thereof, eg NHj-met-arg-cys-arg-ile-val, and substantially 
identical sequences- as defined above for use as N-tenninal 
5 signal sequences for naturally occurring molecules lacking 
such a sequence or any fona of recombinant molecules. Such 
sequences will cause such molecules, eg proteins or 
peptides, to become associated with the ECM. 

Thus, the present invention thus further provides 
10 recombinant protein or peptide which comprises, at its N- 
terminus, the sequence: 

NH2-U-arg-X-arg- 
where u is the residue of a peptide of 20 or fewer amino 
acids, or a covalent bond, and X is any amino acid. 
15 Preferably, u is 20 or fewer, eg 10 or fewer 

residues in size. Preferably, X is glycine or cysteine. 
Preferably, the N terminal sequence comprises: 
NH2-U-arg-X-arg-ile-val-pro-; or 
NH2-U-arg-X-arg-ile-val-pro-leu-leu- 
20 where U and X are as defined above, and substantially 
identical variants thereof. It will be understood that 
substantially identical sequences are those which do not 
substantially alter the ability of the N-terminal sequence 
to direct the protein or peptide to which the sequence is 
25 attacted to the ECM. 

The protein or peptide to which the N-terminal 
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sequences of the invention may be attached may, for 
example, be a hormone, growth factor or cytokine. 
Attachment of sequences according to the invention to such 
. proteins and peptides will cause the resulting recombinant 
5 proteins and peptides to become attached to the ECM. This 
will enable the action of these proteins and peptides to be 
localized. It will be appreciated by those of skill in the 
art that*, for example, a recombinant hormone which 
comprises an N-terminal sequence according to the invention 

10 together with all, or an active fragment of, the hormone, 
will still be considered a hormone for the purposes of the 
invention. Similarly, other naturally occuring proteins or 
peptides modified according to the invention will be 
considered equivalent to the native protein or peptide if 

15 their function is substantially unaltered. 

Such modified proteins or peptides may be used, 
where appropriate, in the treatment or therapy of the hioman 
or animal body. 

The invention further provides recombinant DNA 
2 0 enco'ding a protein or peptide according to the invention. 
The invention also provides a recombinant 
replicable expression vector comprising such DNA, a host 
cell transformed or transfected with such a vector, and a 
polyclonal or monoclonal antibody directed against the N- 
25 terminal of a protein or peptide according to the 

invention. Reference may be made to the description above 
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in connection with LIF for suitable vectors, host cells and 
processes for the production of antibodies. 

The present invention also provides a peptide of 
the formula: 

5 NH2-U-arg-X-arg-ile-val-pro-leu-leu- 

vhere U is the residue of a peptide of 20 or fewer amino 
acids or a covalent bond and X is any amino acid, and 
substantially identical variants thereof. 

In addition, the invention provides a method of 
10 causing a protein or peptide to become attached to the ECM 
of a mammalian cell comprising contacting the ECM with a 
protein or peptide having an N'-terminus of the sequence: 

NH2-U-arg-X-arg-; or 

NH2-U-arg-X-arg-ile-val-pro-leu-leu- 
15 where U is the residue of a peptide of 20 or fewer amino 
acids or a covalent bond and X is any amino acid, and 
substantially identical variants thereof. 

Preferably, the variants of all the sequences 
mentioned above will retain the motif "-arg-X-arg" , where X 
20 is as defined above. 

The following examples describe the isolation and 
characterization of the novel ECM-as^ociated LIF of the 
invention from a murine source. However, ECM-associatod 
LIF from other sources, e.g. human or porcine are within 
25 the scope of the present invention and could be isolated in 
an analogous manner. 
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EXAMPLE 1; MA^'ERIALS & METHQn<? 
CELL CULTURE AND BIOLOGICAL ASSAYS Cell culture was 
performed in a humidified 7.5% CO2 atmosphere in DMEM:Ham«s 
F12 (50:50) containing antibiotics, lO^^M 2- 
mercaptoethanol and 10% (v/v) foetal bovine serum (selected 
5 batches). For maintainance of ES cells, medixim was further 
supplemented with a 1/1000 dilution of supernatant 
conditioned by exposure to Cos-7 cells transfectnd with the 
PC10-6R human DIA/LIF expression plasmid (Smith et al,, 
1988), Ehrlich ascites cells and MRC-5 human embryonic 

10 lung fibroblasts were obtained from the Sir William Dunn 
school of Pathology Cell Bank, University of Oxford; BRL 
and STO cells (Smith and Hooper, 1987) and C3H lOTl/2 mouse 
embryo fibroblasts (Edwards et al., 1987) were as 
previously described. Differentiation assays were 

15 performed on the pluripotential ES cell line CPl, 

generously provided by Dr, Martin Evans, Department of 
Genetics, University of Cambridge. Diffusible DIA/LIF was 
assayed by plating ES cells at an initial density of 10"* 
cells/ml/16mm well and culturing for 4 days in the presence 

2 0 of experimental samples. The samples were then fixed and 
stained with Leishman's and inhibition of morphological 
differentiation was assessed by microscopic inspection 
(Smith and Hooper, 1987). 

Feeder layers were prepared by treatment of 

25 confluent cultures with mitomycin C (lO^g/ml) for 2-3 
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hours, followed by' trypsinisation and replating (Martin and 
Evans, 1975) . The collagenous membrane inserts 
(Transwell™, Costar) were transparent with a pore size of 
0.4Mm. 

5 Cell-free ECM preparations were obtained by 

hypotonic lysis with 0.02M NH4OH (Gospodarowicz, 1984) or 
by detachment of cells with 0.02% EDTA in PBS (Rheinwald 
and Green, 1975) • 

DNA MANIPULATIONS. Standard techniques* (Maniatis et al.) 
10 were used for the construction of all plasmids. The LIF 
numbering system used throughout this paper refers to the 
partial length murine cDNA described by Gearing et al. 
(1987). 

A cDNA encompassing the entire murine LIF coding 
15 region (Gearing et al., 1988) was obtained by the 

polymerase chain reaction (PGR) on first strand cDNA 
generated from Ehrlich ascites RNA from Ehrlich ascites 
cells. Oligo dT was. used to prime reverse transcription of 
2.5/ig of poly A"*" RNA (cDNA Synthesis Kit, Amersham) . After 
20 phenol/ chloroform extraction, RNA was hydrolysed by 

incubation at 4* C overnight in 0.25M NaOH, the solution 
was neutralised by the addition of HCl, and the cDNA was 
purified by Sephadex G-50 chromatography and ethanol 
precipitation. 20ng of the cDNA was used as a template for 
25 PGR amplification using the mLIF specific primers LIFI 
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5 • • . . AAGAATTCCATAATGAAGGTCTTGG. . . 3 • and LIF2 
5 • . • . AAGAATTCAGTCCATGGTACATTCAAGA. . . 3 • These primers , 
which both contain EcoRI restriction sites, hybridise to 
the mLIF cDNA at residues 11 - 28 and 657 - 636 
5 respectively; PGR conditions were as recommended by Perkin 
Elmer-Cetus. Amplification proceeded through a cycle of 
denaturation for 2 minutes at 94* C, annealing for 2 
minutes at 60* C and polymerisation for 3 minutes ar 72* C. 
Amplified DNA was digested with EcoRI, purified from 
10 agarose gels using Geneclean (BIO 101, San Diego, CA) and 
cloned into the EcoRI restriction site of pBluescript II 
KS+ (pDRl) and the expression vector pXMT2 (pDRlO) . 

A probe specific for the 5* end of the mouse LIF 
open reading frame was constructed by digestion of pDRl 
15 with Smal and religation to give pDR2 which lacks mLIF 
sequences downstream of the mLIF Smal restriction site. 

A cDNA encoding the matrix-associated form of 
DIA/LIF was constructed in the expression vector pXMT2. 
DNA derived by RACE PGR cloning (see below) was removed 
20 from pDRldl by Xhol/Smal digestion and cloned into 
Xhol/Smal cut pDRlO to produce the plasmid pDRll. 

RNA ANALYSIS. High specific activity riboprobes were 
synthesised by in vitro transcription of murine LIF cDNA 
fragments cloned into pBluescript II KS+ (Stratagene) . 
2 5 Antisense probes were generated by transcription with T7 
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polymerase (New England Biolabs) after linearisation of the 
template by digestion with Hindlll (pDRl, pDR2) or Xhol 
(pDRlOO, pDRlOl) . A riboprobe specific for the 3' end of 
the mLIF open reading frame was. produced by transcription 
5 of pDRl with T7 RNA polymerase after linearisation of the 
plasmid by digestion with Mel which cuts at nucleotide 353 
in the mLIF cDNA. The reaction conditions were as 
described by Krieg and Melton (1988) except that 6.25m1 of 
Q-32p.uTP (800 Ci/mmol, 40mCi/ml, Amersham) was used in a 

10 15m1 reaction volume, 

Ribonuclease protection assays were carried out 
essentially as described by Krieg and Melton (1988) . IS/xg 
of cytoplasmic RNA (Edwards et al, 1985) was hybridised 
with 6 X 10^ cpm probe (specific activity 3 x 10^ cpm/^g) 

15 in the absence of added tRNA. Hybridisations were carried 
out at 45 'C for 16-20 hours, RNA hybrids were digested 
with 4 0 ^g/ml RNase A (Boehringer) and 2 /ig/ml RNase Tl 
(BRL) at 4'C for 30 minutes. Protected fragments were 
analysed on a 5% polyacrylamide/ 8M urea sequencing gel 

20 which was dried before autoradiography. 

RACEcDNA CLONING. lO^g of cytoplasmic RNA from Ehrlich 
ascites cells was reverse transcribed from the 
oligonucleotide 5 • . . . ACACGGTACTTGTTGCA . . . 3 ' which 
hybridises to residues 500 - 484 of the murine LIP cDNA. 
25 The RNA was added to 20pmol of primer and 2^1 of 10 x 
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anneal buffer (500mM Tris/HCl pH8,0, 60mM MgCl2, 400iiiM KCl) 
in a total volume of 16^1 # heated to 85 'C for 5 minutes and 
cooled slowly to room temperature. The elongation reaction 
was carried out as described by Frohman et al. (1988) • 
5 Excess oligonucleotide was removed by gel filtration 
through a 2ml Sephacryl S-400 (Pharmacia) column 
equilibrated with O.OSxTE. SO/il fractions corresponding to 
the cDNA radioactive peak were pooled, concentrated by 
vacuum centrifugation and resuspended in 23^il of H20, To 

10 tail the 3 ' end of the cDNA with dG residues 3^1 of lOmM 
dGTP, '6m1 of 5 X tailing buffer (BRL) and 15 units of 
terminal deoxynucleotidyl transferase (BRL) were added and 
the mixture was incubated at 37 'C for 60 minutes and then 
70' C for 15 minutes. After ethanol precipitation the cDNA 

15 template was resuspended in 500^1 of H2O. 

PGR was carried out using a mouse LIF specific 
oligonucleotide 5 • . . .TTCTGGTCCCGGGTGATATTGGTCA, • . 3 • (mLIF 
residues 389 - 365) and an anchor oliognucleotide 
5 • . . . CCATGGCCTCGAGGGCCCCCCCCCCCCCC . • . 3 • . The anchor 

20 oligonucleotide includes the restriction sites Apal, Ncol, 
Xhol and Sfil while the mLIF oliogonucleotide includes the 
unique Smal restriction site at nucleotide 379 in the mLIF 
open reading frame. PGR was carried out in a final volume 
of 50m1 containing 7^1 of the cDNA template and 25 pmol of 

25 each oligonucleotide. Reaction conditions were as 

recommended by Perkin Elmer-Getus except that a final 
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c ncentration of 3nM MgCl2 used. DNA was denatured 
prior to the addition of Taq polymerase (Perkin Elner- 
Cetus) by heating the reaction nixture to 94 'C for 5 
minutes. Each PCR cycle consisted of denaturation for 2 
5 minutes at 94*C, annealing for 2 minutes at 55*C and 
elongation for 3 minutes at 72*C. Products of this 
reaction were analysed by Southern blot hybridisation using 
probes derived from pDRl, pDR2 or the LIFI. oligonucleotide. 
After the final elongation (30 minutes at 72 'C) samples 

10 were ethanol precipitated, digested with Smal and Xhol, and 
analysed by agarose gel electrophoresis. DNA was purified 
from agarose gels using Geneclean and cloned into Xhol/Smal 
digested pBluescript II KS+ (Stratagene) . Suitable 
recombinant plasmids were purified by the alkaline lysis 

15 method. The clone corresponding to the diffusible form of 
DIA/LIF was designated pDRlOO and that corresponding to the 
matrix-associated form of DIA/LIF was designated pDRlOl. 

To sequence the RACE cDNA clones, PDRlOO and pDRlOl 
were digested with Xhol and Smll and the DIA/LIF cDNA 

20 fragments were purified from agarose gels using Geneclean 
and cloned into Sall/Smal digested. M13mpl8 and M13mpl9. 
Plasmids were sequenced with Sequenase Version 2.0 (United 
States Biochemicals, Cleveland) according to the 
manufacturers recommendations. 
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EXAMPLE 1 RESTTT.TS 
IDENTIFICATION OF MATRIX-ASSOCIATED DIA/LIF ACTIVITY 
FRODOCED BY C3H lOTl/2 EMBRYONIC FIBROBLASTS 
The naintainance of ES cells in an undifferentiated 
state can be achieved by growing the cells in nedixun 
conditioned by preincubation with BRL cells. This results 
from secretion of soluble DIA/LIF protein into the medium 
5 by BRL cells. By contrast, medium conditioned by different 
feeder cell lines may be unable to prevent or be relatively 
inefficient at preventing ES cell differentiation. This 
suggests that at least part of the DIA/LIF activity 
associated with feeder cells is not diffusible but is 
10 localised to the cell in some way. This could involve 
attachment to either the feeder cell membrane or 
extracellular matrix. 

The results of an experiment designed to test this 
are illustrated in Figure 1. ES cells cultured in direct 
15 contact with lOTl/2 embryonic fibroblast cells were 
maintained in an undifferentiated state (Figure 1, i) . 
This" confirms that lOTl/2 cells, in common with other 
embryonic fibroblast lines such as STO and 3T3, can act as 
feeder cells for the culture of undifferentiated ES cells. 
20 No significant DIA/LIF activity could be detected in medium 
that had been conditioned by preincubation with lOTl/2 
cells (Figure 1. ii) . Furthermore, physical separation of 
the lOTl/2 feeder cells and ES cells by a microporous 
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membrane, which permits the passage of diffusible 
macromolecules but prevents direct cell-cell contact, 
resulted in differentiation of the ES cells (Figure 1^ 
iii) . The requirement for physical contact between lOTl/2 
5 feeder and ES cells and the absence of any secreted, 

diffusible DIA/LIF activity argues for the existence of a 
localised form of the DIA/LIF protein on lOTl/2 cells. 

To identify the location of this accivity lCTl/2 
cells were removed from the culture by treatment with 

10 either ammonium hydroxide, which causes- osmotic lysis, or 
EDTA, which results in the detachment of intact cells from 
the extracellular matrix. Since cells grow above the 
secreted extracellular matrix in vitro , the effect of these 
treatments was to yield a substrate composed almost 

15 exclusively of extracellular matrix. ES cells seeded onto 
this substrate were maintained in an undifferentiated state 
(Figure 1, iv, v) as judged by their overt morphology or 
the presence of the SSEA-1 antigen. 

This result established the existence of a matrix- 

20 associated DIA/LIF activity on lOTl/2 fibroblasts. 
Identical experiments failed to detect an equivalent 
-acLivity on the extracellular matrix of MRC-5 human 
fibroblast cells which secrete the soluble form of DIA/LIF 
into the medium. The matrix-associated DIA/LIF activity of 

25 lOTl/2 cells must therefore be a specific activity produced 
by these cells and not a non-specific effect of the 
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extracellular matrix. Preliminary purification and acid 
stability characteristics suggest that this activity is 
very closely related to the known DIA/LIF glycoprotein. 

IDENTIFICATION OF A TRANSCRIPT ENCODING THE MATRIX 
5 ASSOCIATED FORM OF DIA/LIF 

The probe xised for Northern blots and the initial 
ribonuclease protections was derived from a cDNA fragment 
that contained the entire murine LIF (mUF) open reading 
frame as defined by Gearing et al. (1987). This cDNA was 

10 obtained by PCR on RNA from Ehrlich ascites cells, which had 
previously been shown to secrete solxible DIA/LIF. To test 
for biological activity the mLIF fragment was cloned into 
the unique EcoRI restriction site of pXMT2, an expression 
vector which contains the SV40 origin of replication. 

15 Transfection of this molecule, pDRlO, into cos7 cells and 
bioassay of conditioned medium by its ability to prevent ES 
cell differentiation established that the biological 
activity of this clone was found to be equivalent to that 
of the human LIF gene described by Moreau et al. (1988) 

20 The mLIF cDNA was also cloned into the unique EcoRI 

restriction site of pBluescript II KS to give pDRl. This 
plasmid was transcribed in vitro with T7 RNA polymerase in 
the presence of a-^2p.u»j.p ^^s used to produce radiolabelled 
antisense RNA probes of high specific activity for use in 

2 5 both Northern blots and ribonuclease protections. 



wo 91/13985 PCT/GB91/00334 

- 21 - 

Northern blot analysis of total and poly /A+ RNA 
from Ehrich ascites (EA) and BRL cells revealed a single 
transcript in each cell type which varied in size from 4]cb 
in the mouse cells to 4.5kb in the Buffalo rat cells 
5 (Figure 2). We have been un£ible to detect alternative LIF 
transcripts of 1.8kb (Moreau et al 1988) or 0.7kb (Gough et 
al 1988) by Northern blot despite having screened a variety 
of cell line and embryonic RNAs. 

Total RNA from Ehrlich ascites cells was also 

10 screened for DIA/LIF expression by ribonuclease protection. 
Two transcripts were dete^cted by this technique (Figure 
3A) . The similar sizes of the two protected bands argues 
that they are identical throughout the majority of the mLIF 
coding region since the ribonuclease protection technique 

15 identifies differences between probe and substrate by 
cleavage at the region of dissimilarity. To determine 
whether the divergence between these two messages was at 
the 5' or the 3' end of the mLIF coding region, riboprobes 
specific for either end of the open reading frame were 

20 produced. Of these, only the 5» probe protected two 

different transcripts (Figure 3B) . This establishes that 
the site of divergence between the transcripts occurs at 
the 5' end of the mLIF open reading frame, 20+/"5 
nucleotides into the coding region. This coincides with 

25 the beginning of the second mLIF exon as deduced from the 
genomic human LIF sequence and the partial genomic mLIF 
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sequence. 

The identity of the two transcripts was established 
by analysis of DIA/LIF transcription in two enbryonic 
fibroblast feeder cell lines, lOTl/2 and STO, as well as in 
5 EA and PYS-2 (a parietal yolk sac cell line) (Figure 4A) . 
The upper transcript, D, corresponds to the secreted and 
diffusible form of the DIA/LIF protein since it shares 
complete homology with the riboprobe throughout the LIF 
coding region. This identification is consistent with the 

10 presence of significant levels of this" transcript in RNA 
from Ehrlich ascites and STO cells which secrete DIA/LIF 
activity but not in lOTl/2 cells which do not secrete 
DIA/LIF activity. The identity of the lower transcript, M, 
could be deduced from a comparison -of the ribonuclease 

15 protection pattern and known biological properties of the 
cell lines (Figure 4B) . The appearance of this transcript 
in both STO and lOTl/2 cells suggests that it encodes the 
matrix-associated form of the DIA/LIF protein which allows 
these cell lines to act as ES cell feeders. This 

20 correlation is most apparent in lOTl/2 cells where only the 

o 

lower transcript is present at significant levels and where 
only the matrix-associated foim of the protein can be 
detected. 

These results suggested that the production of 
25 matrix-associated and diffusible forms of the DIA/LIF 
protein is governed by alternative splicing of upstream 
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exons to the known second and third exons of the mUF gene. 

RACE PGR CLOHING OF CDNA ENCODING THE MATRIX- r . 
ASSOCIATED FORM OF DIA/LIP 
5 The race PCR technique was used to clone the 

unknown 5« sequences of the mRNA encoding the matrix- 
associated form of DIA/LIF. ; This technique can be used 
whenever sequence within the open reading frame is known 
and can be modified to clone sequences either 5» or 3' of 

10 the known sequence, Ribonuclease protections had shown 

that the matrix-associated form of DIA/LIF is identical to 
the solxible form of the protein throughout exons 2 and 3, 
An oligonucleotide complementary to residues 500 - 484 of 
the mLIF cDNA (Gearing et al., 1988) was used to prime cDNA 

15 synthesis from Ehrlich ascites total RNA. The cDNA was 
tailed with dGTP residues at the 3» end using terminal 
deoxynucleotide transferase and the resultant single 
stranded DNA was used as a template for PGR using an 
oligonucleotide complement:iry to residues 389-365 of the 

20 mLIF sequence and an anchor oligonucleotide, 

5 • . . . CCATGGCCTCGAGGGCCCCCCCCC. • • 3 ' . The internal 
• oligonucleotide was chosen both to increase the specificity 
of the PGR reaction and because it incorporated the unique 
Smal restriction site of the mLIF open reading frame. This 

25 was used later in reconstruction of the complete open 
reading frame. 
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THE PGR did not yield the expected two bands but 
rather a smear of material between 400 and 600 nucleotides 
which was homologous to mLIF as ascertained by Southern 
blot. The products of an entire PGR reaction were digested 
5 with.XhoI and Smal and separated on a 2% agarose gel. 

Zones of DNA corresponding to mLIF sequences were excised 
from the gel and cloned into Xhol/Snal cut pBluescript II 
KS+s. Recombinant colonies were screened for the size of 
the inserted fragment and the largest inserts among two 

10 distinct size classes were subcloned into M13 and 

sequenced. The sequences of these two clones upstream of 
residue 37 are shown in Figure 5 with the deduced aunino 
acid sequences of the respective proteins. The larger cDNA 
(pDRlOO) encodes the diffusible form of the DIA/LIF 

15 protein. The sequence shown in Figure 5 is identical to 
that of Gearing et al (1988) but extends 122 nucleotides 
upstream of the ATG initiation codon. This may represent 
the normal 5' end of the mRNA encoding the diffusible form 
of the DIA/LIF protein, 

20 The smaller clone (pDRlOl) diverges from the known 

mLIF sequence at precisely the point deduced to be the 
start of the second exon and is therefore a good candidate 
to encode the matrix-associated form of the DIA/LIF 
protein. The amino acid sequence of the protein encoded by 

25 this cDNA differs from that of the diffusible from of 
DIA/LIF in that the first part of the leader sequence. 
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MKVLAAG, is replaced with the sequence VTACR. The identity 
of the ATG initiation codon is confirmed by the presence of 
an in-frame termination codon, UAG, 30 nucleotides upstream 
of the ATG. The. matrix-associated protein retains the 
5 signal sequence of the diffusible mLIF protein between 

amino acid 8 and the cleavage site after residue 23. This 
leader sequence contains the hydrophobic core sequence that 
is required for translocation of the LIF proteins across 
the cell membrane. The sequence of the pDRlOl cDNA extends 
10 31 nucleotides upstream of the ATG. This may represent the 
5' end of the RNA encoding the matrix-associated DIA/LIF 
protein. 

To confirm the identities of the two RACE clones 
riboprobes derived from pDRlOO and pDRlOl were used to 

15 protect cytoplasmic RNA from Ehrlich ascites cells. 

Antisense riboprobes extending from the 5* ends of the 
pDRlOO and pDRlOl cDNAs to the Smal restriction site within 
the mLIF open reading frame were produced by in vitro 
transcription of these clones. The protection patterns 

20 obtained with these riboprobes were compared with the known 
pattern generated by the 5' riboprobe derived from pDR2. 

The results and interpretation of this experiment 
are shown in Figure 6. Each riboprobe protected a fragment 
of 34 5 nucleotides, the distance between the 5* end of mLIF 

25 exon 2 and the Smal restriction site- This band (c) 

therefore corresponds to transcript M for probes pDR2 and 
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pDRlOO, and transcript D for probe pDRlOl. Probes pDRlOO 
and PDRlOl both protected fragments larger than those 
protected by probe pDR2 (345 and 369 nucleotides). This 
establishes that the sequences upstream of the ATG in these 
5 RACE clones are present in the cellular D"and M transcripts 
respectively, and confirms the identification of clone 
pDRlOO as a cDNA encoding the diffusible form of DIA/LIF. 
The presence of two protected bands larger than 369 
nucleotides suggests that the promoters directing 
10 expression of the two DIA/LIF transcripts each contain two 
RNA initiation sites. The deduced positions of these sites 
are indicated in Figure 5. 
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CLAIMS 

1. A siobstantially pure LexxJcaemia Inhibitory Factor 
(LIF) which is capable of associating with a namnalian 
extracellular matrix. 

2. A LIF according to claim 1 wherein the N-terminal 
sequence comprises: 

NH2-U-arg-X-arg- 
where U is the residue of a peptide of 20 or fewer amino 
acids, or a covalent bond, and X is any amino acid. 

3. A LIF according to claim 2 wherein U represents the 
residue of methioneine and X is cys orgly. 

4. A LIF according to either of claims 2 or 3 wherein 
the N terminal sequence comprises: 

NH2-U-arg-X-arg-ile-val-pro-leu-leu- 
where U and X are as defined in claim 2, and substantially 
identical variants thereof. 

5. - A recombinant DNA encoding the LIF of any one of 
claims 1 to 4 • 

6. A recombinant replicable expression vector 
comprising the DNA of claim 5. 



I 
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7- A host cell transformed or transfected with the 

vector of claim 6. 

8. A polyclonal or monoclonal antibody directed 
against the N-terminal of a LIF according co any one of 
claims 1 to 4, or produced by a host cell according to 
claim 7. 

9. A pharmaceutical composition comprising a LIF 
according to any one of claims 1 to 4 or any antibody 
according to claim 8 in association with a pharacologically 
acceptable carrier or diluent. 

10. A recombinant protein or peptide which comprises, 
at its N-terminus, the sequence: 

NH2-U-arg-X-arg- 
where U is the residue of a peptide of 20 or fewer amino 
acids, or a covalent bond, and X is any amino acid. 

11. A recombinant protein of peptide according to 
either. of claims 9 or 10 wherein the N terminal sequence 
comprises: 

NH2-U-arg-X-arg-ilo-val-pro-leu-leu- 
where U and X are as defined in claim 8, and substantially 
identical variants thereof. 
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12. A recombinant protein or peptide according to claim 
10 or 11 which is a hormone, growth factor or cytokine. 

13. A recombinant DNA encoding a protein or peptide 
according to any one of claims 9 to 12* 

14. A recombinant replicablt expression vector 
comprising the DNA of claim _13. 

15. A host cell transformed or transfected with the 
vector of claim 14. 

16. A polyclonal or monoclonal antibody directed 
against the N-terminal of a protein or peptide according to 
any one of claims 9 to 14, or produced by a host cell 
according to claim 15. 

17. A peptide of the formula: 

NH2-U-arg-X-arg-ile-val-pro-leu-leu-C00H 
where U is the residue of a peptide of 20 or fewer amino 
acids or a covalent bond and X is any amino acid, and 
substantially identical variants thereof. 
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18. A method of causing a pro.tein or peptide to become 
attached to the ECM of a mammalian cell comprising 
contacting the ECM with a protein or peptide having an N- - 
terminus of the "sequence: t 

NH2 -U-arg-X-arg- 
where U is the residue of a peptide of 20 or fever amino 
acids or a covalent bond and X is any 2unino acid. 

19. A method according to claim 18 wherein the N- 
terminal of the protein or peptide comprises: 

NH2-U-arg-X-arg-ile-val-pro-leu-leu- 
where U and X are as defined in claim 18, and substantially 
identical variants thereof. 

20. The use of LIF according to any one of claims 1 to 
4 or an antibody according to claim 8 in method of 
treatment of the human or animal body by therapy or 
diagnosis. 

21. A method of therapy of conditions associated with 
abnormal and/or aberrant expression of LIF in a mammal . 
including man, which will benefit from an alteration in 
naturally occuring levels of LIF which comprises 
administering to a recipient in need of such therapy an 
effective amount of a LIF according to claim 1 or an 
antibody according to claim 8. 
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ALTERNATIVE FORMS OF DIA/LIF 

CTAQTCCCTQGAAAGCTQTQATT66CQC6A 6ATGAGATGCAQGATT MATRIX 

GGAGTCCAQCCCAT AATGAAGGTCTTGGCCGCAGGGATT DIFFUSIBLE 
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MKVLAAG^IVPLLLLVLH DIFFUSIBLE 
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